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Abstract 



The paper establishes Horton self-similarity for tree representation of Kingman's co- 
alescent process and, equivalently, level-set tree of a white noise. The proof is based on a 
Smoluchowski-type system of ordinary differential equations for the number of Horton- 
Strahler branches in a tree that represents Kingman's coalescent. This approach al- 
lows us to establish a weak form of the Horton law, namely the existence of a limit of 
(Nk/N) 1 ^ for the number of branches of Horton-Strahler order k in a coalescent 
system of size N, as we let ./V — > oo and then k —> oo. We conjecture, based on numer- 
ical observations, that the Kingman's coalescent is also Horton self-similar in regular 

strong sense with Horton exponent R = lim I lim (Nk+i/N^) I = 0.328533... and 



asymptotically Tokunaga self-similar. Finally, we demonstrate combinatorial equiva- 
lence between the trees of a Kingman's coalescent and level-set trees of a discrete white 
noise. 

1 Introduction 

When studying the tree graphs associated with random structures one often aims at discov- 
ering a particular symmetry or a consistent pattern such as self-similarity. There exist two 
important types of tree self-similarity related to the Horton-Strahler [Til 139] and Tokunaga 
[JT] indexing schemes for tree branches (see Sect. [4] for definitions). Introduced in hydrology 
in the 1960s to describe the dendritic structure of river networks, which is among the most 
evident examples of natural branching, these indexing schemes have been rediscovered and 
used in other applied fields. The Horton-Strahler indexing assigns orders to the tree branches 
according to their relative importance in the hierarchy. For instance, in a perfect binary tree 
of depth d max the Horton-Strahler order r is uniquely determined by the node depth d: 
r = d max — d+1. Hence, all leaves have order 1 and the root has the maximal order d max + 1. 
The Horton-Strahler orders have been shown useful in diverse applications that range from 
ranking river tributaries to analysis of brain structure to designing optimal computer codes; 
see PH [391 CE2 S3 ESI E E2 S2l 12 IH E] and references therein. Quite often, observed 
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systems exhibit geometric decrease of the numbers Nk of elements of Horton-Strahler order 
k > 1; this property is called Horton self-similarity. The common ratio R of the respective 
geometric series is called the Horton exponent. In accordance with Horton-Strahler rules (see 



Sect. 4.2, Eq.(|2j)), a branch of order k > 1 can only be created by merging of two branches of 
orders (k — 1). The Horton self-similarity hence addresses the principal branching in a tree 
and presents a counterpart of the power-law element size distribution. The last statement 
becomes clear if one naturally interpret the Horton-Strahler order k as a logarithmic measure 
of element size s: k « log s. A stronger Tokunaga self- similarity addresses so-called side 
branching - merging of branches of distinct orders. The Tokunaga self-similarity implies that 
different levels of a hierarchical system have the same statistical structure, in a sense to be 
defined; it is parameterized by a positive pair (a, c). 

A classical model that exhibits Horton and Tokunaga self-similarity is critical binary 
Galton- Watson tree [29JHJET], also known in hydrology as Shreve's random topology model 

EZ] ; it is conditionally equivalent to the uniform distribution on planar binary trees with 
TV leaves [3T]. This model has R = 4 and (a, c) = (1,2). For long time, the critical bi- 
nary Galton- Watson tree has remained the only well-known probability model for which the 
Horton and Tokunaga self-similarity was rigorously established, and whose Horton-Strahler 
ordering has received attention in the literature [361 E3 El El [29], [271 II]- The model 
has been particularly popular in hydrology as an approximation to the topology of the ob- 
served river networks [40]. Scott Peckham [29] has first explicitly noticed, by performing a 
high-precision extraction of river channels for Kentucky River, Kentucky and Powder River, 
Wyoming, that the Horton exponents and Tokunaga parameters for the observed rivers sig- 
nificantly deviate from that for the Galton- Watson model. He reported values R m 4.6 and 
(a, c) ~ (1.2, 2.5) and emphasized the importance of studying broad range of Horton expo- 
nents and Tokunaga parameters. The general interest to fractals and self-similar structures 
in natural sciences during the 1990s led to a quest, mainly inspired and led by Donald Tur- 
cotte, for Tokunaga self-similar tree graphs of diverse origin. As a result, the Horton and 
Tokunaga self-similarity, with a broad range of respective parameters, have been empiri- 
cally or rigorously established in numerous observed and modeled systems, well beyond river 
networks. This includes vein structure of botanical leaves [2~4T 14*5] , diffusion limited aggre- 
gation [261 [21] , t wo dimensional site percolation [421 HQ EJH [51] , nearest-neighbor clustering 
in Euclidean spaces [45], a general hierarchical coagulation model of Gabrielov et al. [9] 
introduced in the framework of self-organized criticality, and a random self-similar network 
model of Veitzer and Gupta [44] developed as an alternative to the Shreve's random topology 
model for river networks. The increasing empirical evidence prompted the question: What 
basic probability models can generate Tokunaga self-similar trees with a range of parameters? 
An illuminating result in this direction has been obtained by Burd et al. [4]; these authors 
have demonstrated that Tokunaga self-similarity is a characteristic property of the critical 
binary branching in the broad class of (not necessarily binary) Galton- Watson processes. 
Hence, generic branching cannot explain the variability of Horton and Tokunaga parameters 
seen in nature. Recently, the authors of the current paper established Horton and Tokunaga 
self-similarity for the level-set tree representation of a homogeneous discrete Markov chain 
and infinite-tree representation of a regular Brownian motion in continuous time [49]. The 
results in [49] (i) expand the class of Horton and Tokunaga self-similar processes beyond 
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the critical binary Galton- Watson branching, since the tree representation of Markov chains 
in general is not equivalent to the Galton- Watson process [311 EH], and (ii) suggest that 
coalescent (aggregation) phenomena may be better suited for generating self-similar trees. 
This study is a first step towards exploring self-similarity of trees generated by coalescent 
processes. 

In general, inferring the dynamics, coalescence (aggregation) versus headward growth 
(branching), for a static network pattern is non-trivial and largely unsettled; see [T2 1 \TE[ ITU] 
for both models and empirical approaches. In this regard, it is of interest to analyze the 
behavior of self-similarity statistics, such as Horton ratios, for both classes of models. The 
results in this paper focus on coalescence and are quite distinct from those covered by the 
bulk of the mathematical theory to date. 

Coalescent processes have been much studied in diverse scientific well as in 

applied probability literature. We refer to David Aldous [2] for a comprehensive review 
and further references, and only mention here that coalescents offer an important tool for 
studying inheritance relationships in population genetics [151 HS1 El ED] , formation of planets 
and galaxies in astronomy [321133] , fracturing of solids [31 [31], development of drainage basins, 
including extraterrestrial ones [35, 13J, dynamics of aerosols [7J, kinetics of polymerization 
and gelation [32], algae growth [JJ, evolutionary driven aggregation of animals [28J and 
many other phenomena. Generally speaking, a coalescent describes the process of merging 
of particles into consecutively larger clusters. If the coalescence rate K(x, y) of two clusters 
depends only on their masses (sizes) x and y, then the mean-field dynamics of a coalescent 
process is described by the celebrated Smoluchowski integro-differential equation [3H1 12] (see 
also Sect. pj). The most studied in this class are the Kingman's (K(x,y) = 1), additive 
(K(x,y) = x + y), and multiplicative (K(x,y) = xy) coalescent processes, which allow 
an explicit analytic solution of the Smoluchowski equation; see [2j [25J [31] and references 
therein. Much can be learned about coalescent processes from studying the corresponding 
time-oriented tree graphs that describe the history of cluster merging. 

This paper focuses on the trees generated by a Kingman's coalescent and its finite version, 
Markus-Lushnikov coalescent process, with a finite number N of particles, as N approaches 
infinity. We establish a particular form of the Horton self-similarity, called here root-Horton 
law for the Kingman's coalescent. We also show that the tree that describes a Kingman's 
coalescent is combinatorially equivalent to the level-set tree of a white noise. These findings 
add two important classes of processes to the realm of Horton-self-similar systems. Finally, 
we perform numerical experiments that suggest that the Kingman's coalescent, and hence 
the level-set tree of a white noise, are Horton self-similar in a regular stronger sense as well 
as asymptotically Tokunaga self-similar. 

The paper is organized as follows. The main results are summarized in Sect. [2} The 
Kingman's coalescent process is introduced in Sect. [3j Section [4] introduces the concepts of 
Horton and Tokunaga self-similarity and the related Horton-Strahler and Tokunaga ordering 
schemes of tree branches. The Smoluchowski-Horton system of equations that describe the 
dynamics of Horton-Strahler branches in Kingman's coalescent is written out in Sect. [5j Sec- 
tion [6] gives the proof of the existence of root-Horton law for Kingman's coalescent, together 
with some related results. Section [JJ establishes the equivalence between the Kingman's coa- 
lescent tree and the level-set tree of a white noise. Numerical results and the corresponding 



3 



Y.V. Kovchegov and I. Zaliapin 



Horton self-similarity of Kingman's coalescent 



conjectures are collected in Sect. [8) A Smoluchowski-Horton system for a general coalescent 



process with collision kernel is written in Sect. M Section 10 concludes. 



2 Statement of results 

A trajectory of a coalescent process with a finite number of particles can be naturally de- 
scribed by a time-oriented tree T constructed as follows. Start with N leaves that represent 
the initial N particles. When two (or more) particles coalesce into a cluster, merge the corre- 
sponding leaves to form an internal vertex. Similarly, when two (or more) clusters coalesce, 
merge the corresponding vertices. The final coalescence would form the tree root. Each 
vertex within T is assigned a time of the respective coalescence. The resulting time-oriented 
tree represents the history of the process. It is readily seen that there is one-to-one map 
from the trajectories of a coalescence process with N particles onto the trees with N leaves. 
If multiple coalescences are not allowed, the corresponding trees are binary. 

In this paper we prove that the time-oriented tree generated by a Kingman's coalescent 
process is Horton self-similar. We will often rely on the fact that the Kingman's coalescent is 
a limit, in a sense to be defined, of a Markus-Lushnikov coalescent process with N particles 
and constant collision kernel as iV — > oo (see Sect. [3]). We notice also that Horton and 
Tokunaga analyses only consider the combinatorial structure (branching) of a tree and ignore 
its metric properties (in our case - the time marks). Specifically, consider the asymptotic 
proportion 

Af k = lim N k /N 

N— >oo 

of the number N k of branches of Horton-Strahler order k in the Markus-Lushnikov coalescent 
of size N with constant collision kernel. Our construction of infinite tree in Sect. [7] allows 
one to interpret A4 also as the proportion of branches of order k in the infinite tree that 
corresponds to the Kingman's coalescent. 
We show in Sect. [5] that 

1 ,2/ 



where the sequence g k {x) solves the following system of ordinary differential equations: 

cP* {x\ 

9k+i(x) ~ ^j 1 + 9k(x)g k+1 (x) =0, x > 

with gi(x) = 2/(x + 2), ^(0) = for k > 2. We also establish the following equivalent 
relation: 



A4 = / [1 - (1 - x) /i fc -i(a;)] 2 dx, 
Jo 



where 

K+i( x ) + h\{x) - 2h k (x)h k+1 (x) = 0, x e [0, 1] 
with ho = 0,h± = 1, and hk(0) = 1 for k > 1. The following theorem is proven in Section [6j 
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Theorem [TJ The tree associated with a Markus-Lushnikov coalescent with N particles 
and constant collision kernel is asymptotically Horton self-similar as N — > oo. Namely, 
lim (7V fc )^ = R, with l/A<R< 1/2. 

k— >oo 

In Section [7] we establish the following equivalence that expands the above Horton self- 
similarity to the level-set tree representation of time series of i.i.d. random variables, i.e. the 
white noise. 

Theorem [2} Level-set tree of a discrete white noise of length N is combinatorially equivalent 
to the tree generated by a Markus-Lushnikov coalescent process on N particles with constant 
collision kernel. 

Construction of an infinite tree for Kingman's coalescent as a limit of finite trees for 
Markus-Lushnikov process of increasing size (see Sect. [7]) leads to the following result. 

Corollary. The infinite tree generated by the Kingman's coalescent is Horton self-similar, 
in the sense defined in Theorem^ 

Finally, our numerical experiments in Sect. [8] suggest that the tree corresponding to a 
Kingman's coalescent obey a stronger version of Horton as well as asymptotic Tokunaga 
self-similarity. In addition we estimate the Horton exponent R and Tokunaga parameters 
(a, c). The following conjecture summarizes the results of our numerical experiments. 

Conjecture. The tree associated with Kingman's coalescent process is Horton self-similar 
with 

lim j" 1 = lim (A4)^ = R and lim — \ = const., 

k— >oo JXf. k— >oo k— >oo H, k 

where R = 0.328533 . . . and Tokunaga self-similar, asymptotically in k: 



lim T ii+k =: T k and lim 

i— >oo 

for some positive a and c. 



fc-p-oo C k 1 



3 Kingman's coalecent 

We start by considering a general finite coalescent process defined by collision kernel [3IJ, also 
called Marcus-Lushnikov process [21 [201 HH] ■ The process begins with N particles (clusters) of 
mass one. The cluster formation is governed by a collision rate kernel K(i,j) = K(j,i) > 0, 
1 — hj — N — 1. Namely, a pair of clusters with masses i and j coalesces at the rate 
K(i,j)/N, independently of the other pairs, to form a new cluster of mass i+j. The process 
continues until there is a single cluster of mass N. 

Formally, for a given iV consider the space of non-negative partitions 

N 

= {(^1, • • • , n N ) such that Ui = N}. 
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The Markus-Lushnikov process ML^(t) with a positive symmetric collision kernel K(i,j) 
is a continuous-time finite-state Markov chain on partitions U^. The process only can make 
transitions of the form 

(ni, . . .,n N ) -)■ (m, . . . ,Ui - 1, . . . ,rij - 1, . . .,n i+j + 1, . . .,n N ) 

at homogeneous transition rates K(i,j) UiUj/N. Here nu represents the number of clusters 
of mass k in the Marcus-Lushnikov process. 

The above construction uses different state space for each N. To define coalescence on 
the same state space, which is useful for studying the limit behavior as N — > oo, we introduce 
infinite partitions 

oo 

X = {(xi > Xi ■ ■ • > 0) such that Xj = 1}. 

i=i 

A stochastic coalescent X(t) = (Xi(t),i > 1) is, informally, a continuous-time Markov chain 
with transitions 

(xi, . . . , Xj_x, Xj, Xj_|_i, . . . , Xj_i, Xj, Xj_|_i, • • • ) y 

(xi, . . . , Xi -\- Xj, • • • , Xj_i, Xj_|_x, . . . , Xj'— i, Xj'4-i, . . . ) 

that happen at homogeneous rates K(xi,Xj). We refer to [21 [31] for a formal definition. In 
case of a homogeneous kernel 

K(cx, cy) = c 7 K(x, y), < c, x, ?/ < oo, 7 > 0, 

we have [2] 

Xi(t) = A/" -1 ML^ (t iV 1-7 ) , 

where ML^' (t) denotes the mass of the ith largest cluster in the Markus-Lushnikov process. 

The Kingman's coalescent K(t) can be defined (2] as a unique version of the stochastic 
coalescent with kernel K(i,j) = 1 and the initial condition 

lim(a.s.)Xi(i) = 0. 

The Kingman's coalescent K(t) is a unique limit of the Markus-Lushnikov process as N — > 00. 
Formally [2], if 

N' 1 ML[ W, (0) A 0, as iV 00 

then 

jiV" 1 ML (iV) (iVt),0 < t < 00} A {K(i),0 <t < 00}. 

The history of a coalescent process can be naturally described by a binary tree with iV 
leaves and N — 1 internal vertices, each of which corresponds to an individual collision. We 
study here the self-similarity of this tree as iV — > 00. 
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Let Xj.(t) be the mean number of clusters of mass k per unit volume at time t > in a 
coalescent process with collision kernel K(i,j). Then, at time t > 0, the average number of 
pairs of clusters coalescing into a cluster of mass k per unit time per unit volume is 



fc-i 

-J2K(j,k-j)X,(t)X k ^(t), 



3=1 

and the average number of clusters of mass k coalescing with clusters of mass ^ k per unit 
time per unit volume is 

X k (t) K(k,j)Xj(t). 

Finally, the average number of clusters of mass k coalescing with clusters of the same mass 
k is 



lim — 

7V->oo N 



NX k {t)\K(k,k) 



N 



X 2 k (t)K(k,k). 



The Smoluchowski coagulation equations describe evolution of Xkit) for k > 1 as iV — )■ oo 
via a system of ODEs 



d_ 
dt 



fe— 1 oo 

X k (t) = -J2 K (j:k-j)X j (t)X h _ j (t)-X h (t)J2 K (kJ)X j (t) 



(1) 



3=1 



3=1 



with initial conditions X k (0) = Si(k), using Kronecker 5i(k); see [2] and references therein. 
The Smoluchowski ODEs account for the number of clusters of masses j and k — j that 
coalesce to new clusters of mass k, and for the clusters of mass k that coalesce with other 
clusters to form clusters of masses exceeding k. We will employ a similar approach in the 
case of Smoluchowski- Horton ODEs for Kingman's coalescent in Section [5j 



4 Self-similar trees 

This section describes the essential types of self-similarity for trees based on the Horton- 
Strahler and Tokunaga schemes for ordering nodes in a rooted binary tree. Historically, 
the presented approach was introduced by Horton [11] for ordering hierarchically organized 
river tributaries; the methods was later refined by Strahler [39] and further expanded by 



Tokunaga [JT] to include so-called side-branching (see Sect. 4.2 below). 



4.1 Planar rooted trees 

A graph Q = (V,E) is a collection of vertices (nodes) V = {fj}, 1 < % < Ny and edges 
(links) E = {e^}, 1 < k < N E . In a simple graph each edge is defined as an unordered pair 
of distinct vertices: VI < k < N E , 3! 1 < i,j < N v , i ^ j such that e k = {v^Vj) and we say 
that the edge k connects vertices V{ and Vj. Furthermore, each pair of vertices in a simple 
graph may have at most one connecting edge. A tree is a connected simple graph T = (V, E) 
without cycles. In a rooted tree, one node is designated as a root; this imposes a natural 
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direction of edges as well as the parent-child relationship between the vertices. Specifically, 
of the two connected vertices the one closest to the root is called parent, and the other - 
child. We consider planar (or labeled) trees, meaning that we will distinguish the children of 
the same parent. The total number of distinct trees with n leaves, according to the Cayley's 
formula, is n n ~ 2 . The total number of distinct binary trees with n leaves is given by the 
(n — l)-th Catalan number [31] 

1 / 2n-2 
n \ n — 1 

4.2 Horton-Strahler and Tokunaga order schemes 

The Horton-Strahler ordering of the vertices of a finite rooted labeled binary tree is performed 
in a hierarchical fashion, from leaves to the root [21J IH El EH]: (i) each leaf has order 
r(leaf) = 1; (ii) when both children, ci,C2, of a parent vertex p have the same order r, the 
vertex p is assigned order r(p) — r + 1; (iii) when two children of vertex p have different 
orders, the vertex p is assigned the higher order of the two. Figure [T|a) illustrates this 
definition. Formally, 

r(p) = l r(ci) + 1 if r(ci) = r(c2) ' (2) 

' 1 max {r(ci), r(c 2 )} if r(c\) ^ r(c 2 ). 

A branch is defined as a union of connected vertices with the same order. The branch 
vertex nearest to the root is called the initial vertex, the vertex farthest from the root is called 
the terminal vertex. The order Q(T) of a finite tree T is the order of its root, or, equivalently, 
the maximal order of its branches (or vertices). Let N r denote the total number of branches 
of order r. An equivalent, and intuitively more appealing, definition of the Horton-Strahler 
orders can be given using the operation of pruning; see [U EHl EH] . 

The Tokunaga indexing [2JJ HTJ EH] extends upon the Horton-Strahler orders; it is illus- 
trated in Fig. [T]d. This indexing focuses on side-branching, which is the merging between 
branches of different order. Let r^- , 1 < k < Nj, 1 < i < j < Q denotes the number of 
branches of order i that join the non-terminal vertices of the k-th branch of order j. Then 
Nij = t-j \ i < j, is the total number of such branches in a tree T. The Tokunaga index 
Tij = Nij/Nj is the average number of branches of order i < j per branch of order j in a 
finite tree of order Q > j. 

In a probabilistic setting, one considers a space of finite binary trees with a suitable 
probability measure. Then, iVj, N^, and r^- become random variables. We notice that if, 

for a given {ij}, the side-branch counts are independent identically distributed random 

variables, t-^ = t^, then, by the law of large numbers, 

E (nj) as Nj oo, 

where the almost sure convergence X r —4 fx is understood as P lim X r = li) = 1 . 

For consistency, we denote the total number of order-i branches that merge with other 
order-i branches by N& and notice that in a binary tree Nu = 2Ni + \. This allows us to 
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formally introduce the additional Tokunaga indices: = Nn/N i+1 = 2. The set {T^}, 
1 < % < Q — 1, 1 < j ' < n, i < j of Tokunaga indices provides a complete statistical 
description of the branching structure of a finite tree of order Q. 

Next, we define several types of tree self-similarity based on the Horton- Strahler and 
Tokunaga indexing schemes. 



4.3 Horton and Tokunaga self-similarity laws 

Let Mk define the asymptotic ratio of the branches of order k in a finite rooted tree T to the 
number N of the tree leaves defined as an almost sure limit when iV — > oo: 

M k = lim — 

Horton self- similarity implies that the sequence M k decreases in a regular geometric fashion 
with k. Formally, we define three types of Horton self-similarity. 

Definition 1. A finite rooted tree T is said to obey a Horton law if and only if at least one 
of the following limits exists and is finite and positive: 

(a) root law : lim [Mk ) * — R> 0, 

(b) ratio law : lim ff k = R > 0, 

M k 

(c) geometric law : lim — = iVo > 0. 

fc— >oo R k 

The constant R is called the Horton exponent. 

It is easily seen that the three versions of the Horton law are consecutively more stricter: 
(c) =^ (b) =^ (a). The ratio and geometric Horton laws are usually considered in the literature 
(e.g. , [291 H31 HB H2 El [22j H9] ) . For instance, a well-known result is that a tree corresponding 
to a critical binary Galton- Watson process, also known as Shreve's random topology model, 
obey the geometric Horton law with R = 4: see [351 [Til W7[ 1291151 HI 135]. 

A property that is more restrictive than a Horton law is conventionally required though 
to call a tree self-similar. 

Definition 2. In a deterministic setting, a tree T of order Q is called self-similar if its 
side-branching structure (i) is the same for all branches of a given order j: 

4 fc)=:7 y> ±<k<Nj, i<i<j<n, 

and (ii) is invariant with respect to the branch order: 

Ti(i +k ) = T i(i+k) =:T k for 2 < i + k < fi. (3) 
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Definition 3. A self-similar tree is called Tokunaga self-similar if it obeys an additional 
constraint first considered by Tokunaga |41j : 



T fc+ i/T fe = c <£> T k = a c"' 1 a, c> 0, 1 < k < O - 1. 
For a tree satisfying the ratio-Horton and Tokunaga laws, one has [HJ 129] : 

2 + c + a+ v/(2 + c + a) 2 -8c 

-fi — z ■ 



(4) 



(5) 



McConnell and Gupta |22] have shown that the ratio Horton law holds in a self-similar 
Tokunaga tree. Zaliapin jl8] has shown, moreover, that geometric Horton laws hold in a 
Tokunaga tree and, at the same time, even weak Horton laws may not hold in a general, 
non- Tokunaga, self-similar tree. 

--■ T k for 



Jj) 

! i{i+k) 



In a random setting, we say that a tree T of order Vt is self-similar if E 

1 < 3 < 2 < i + k < Q; and it is Tokunaga self-similar if, furthermore, the condition 

Q holds. 



5 Smoluchowski-Horton ODEs for Kingman's coales- 
cent 

Consider a Marcus-Lushnikov coalescent process with a constant kernel starting with N 
particles. We will write Smoluchowski-type ODEs for the Horton-Strahler branches of the 
tree that describes the process and consider asymptotic version of this equation as N — > oo. 
We call the resulting system the Smoluchowski-Horton equations. 



5.1 Main equation 

Let r](t) be the mean number of clusters (of any mass) at time t > per unit volume. Thus 
77(0) = N/N = 1 and rj(t) decreases with each coalescence of clusters, at the rate of 

1 fN V (t)\ V \t) 
hm — 

V-!.oo N 2 

per unit volume, as 1/N is the coalescence rate for any pair of clusters regardless of their 
masses. Thus 77 (i) satisfies the following ODE 

> ~ "f ■ «o 

For any j G N we define t]j(t) to be the mean number of clusters of Horton-Strahler order 
j per unit volume at time t. Initially, each particle represents a leaf of Horton-Strahler order 
1. Thus, the initial conditions are set to be, using Kronecker's delta notation 
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Recall that according to the definition of Horton-Strahler order, connecting two branches 
of orders i and j, where i ^ j, would result in a branch of order max{i, j}. Connecting two 
branches of the same Horton-Strahler order, % = j, would result in a branch of order j + 1. 
The evolution of Horton-Strahler orders is hence very different from the evolution of cluster 
sizes. We describe this evolution using ODEs as follows. 

Observe that at any time t, the average number of pairs of clusters of Horton-Strahler 
order j — 1 that coalesce into clusters of order j, per unit time per unit volume is 

tv^oo N 2 \ 2 J 2 

Thus Vj -^ is the instantaneous rate of increase of r]j(t). 

The number of clusters of order j will decrease by one if a cluster of order j will coalesce 
with a cluster of order strictly higher than j, and will decrease by two if a cluster of order 
j will coalesce with another cluster of order j. The average number of clusters of order j 
coalescing with clusters of order strictly greater than j per unit time per unit volume is 



v 3 (t) \v(t)-J2vk(t)), 



k=l 

and the average number of pairs of coalescing clusters of order j per unit time per unit 
volume is 

1 (N Vj ( t )\ = vM 
n^oo N 2 \ 2 J 2 

Thus the instantaneous rate of decay of r]j(t) is 

vAt) ^(0-5>k(f)) +v-(t). 

Now we can write the rates-in and the rates-out for the clusters of Horton-Strahler order 
via the following Smoluchowski-Horton system of ODEs: 



d 



dt 



(t) = r ^--v j (t) Ut) - £ m(t)) (7) 

V k:k<j J 



with the initial conditions ^(0) = Si(j) stated earlier. Here we let 770 = 0. 

We showed that r)j(t) has the instantaneous rate of increase - ■ The total per volume 
number of clusters of Horton-Strahler order j is therefore 

M = Si(j) - 



In a tree generated by the Kingman's coalescent process, the above quantity Mj represents 
the asymptotic ratio for the number of branches of order j relative to the total number of 
leafs N. 
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It is not hard to compute the first few terms of sequence A4 by solving equations (|6| and 
([7]) in the first three iterations: 

1 e 4 e 2 233 

A/i = l, A/" 2 = -, and A/" 3 = —-- + — = 0.109686868100941... 

o iZo o oo4 



Hence, we have M 2 /Mi = 1/3 and A/" 3 /W 2 = 3x (f^-y + HJ = 0.3290606043028 . . . 
Our numerical results in Sect. [8] yield, moreover, 

lim (A4)^ = lim = 0.328533 . . . 

In the next section we prove the existence of the asymptotic limit lim (7\4) E for the root- 

fc— >oo 

Horton law. 



6 Existence of the root-Horton law and related results 

Let g\{t) = r)(t) and gj(t) = rj(t)— ^ Vk(t) be the asymptotic number of clusters of Horton 

k: k<j 

order j or higher at time t. We can rewrite ^ via gj using r)j(t) = gj(t) — gj+i(t) as follows 

j t 9 3 (t) - j t 9 j+ i(t) = 9i ~ l{t) ~ 9i{t) - ( 9j (t) - g j+1 (t)) 9j (t) 
Observe that gi(t) > g%(f) > 9 3{t) > ••• We now rearrange the terms, obtaining for all 

i>2, 

j t 9j+i(t) - + 9i(t)g j+ i(t) = j t 9At) - + gj-i{t)gj(t)- ( 8 ) 

One can readily check that 4j. 9 2(t) — ^r^ + <?i(£) gz(t) = 0; the above equations hence simplify 
as follows 

g' J+1 (x)- 9 ^P-+g J (x)g 1+1 (x) = (9) 
2 

with gi(x) = —-7,, and ^ (0) = for j > 2. 

2 / \ 

Notice that the above system of ODEs (9 ) is the quasilinearised Riccati equation g'(x) = —^-ip- 
with the initial value g(0) = that has only a trivial solution. 

Next, returning to the asymptotic ratios of the number of order- j branches to N, we 
observe that (|8l implies that 
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since 

,yj\ L ) "r 



dr JK ' 2 



oo 



where f ^gj(t)dt = gj(oo) — g(0) = for j > 2. Let n& represent the number of order-A; 



o 



branches relative to the number of order-(fc — 1) branches: 



oc 



f^dx II 112 

Mk o \\9k\\ L 2[ 0oo ) 



A4-1 J ti-lW fa H^-l|li2[0,oo) 





Consider the following limits that represent respectively the root and the ratio asymptotic 
Horton laws: 



\n II 2 

|yfc|lL 2 [0,oo) 



lim (A4) k = lim ( TT rij I and lim = lim 

k— >oo k— >oo \ / 7— >oo k— »oo 



We expect both the limits to exist and to be equal to 0.328533 . . . according to our numerical 
results. We now establish some basic facts about gj and rij. 



Proposition 1. Let gj(x) be the solutions to the systems of ODEs Q). Then 

°° g 2 {x) °° 

( a ) / ^rdx = J gj(x)g j+ i(x)dx, 



oo oo 

( b ) I gj+i( x ) dx = I(gj( x ) - gj+i(x)) 2 dx, 





(c) lim xgj(x) = 2, 



(r\\ rt ■ - llgill ^ 2 [°-°°) < 1 
W U 3 ~ \\9i-i\W, S 2' 

" yj i|l i 2 [0,oo) 



(e) n = „" gill gP»-) > \. 

llyj "L 2 [0,oo) 



Proof. Part (a) follows from integrating (|9]), and part (b) follows from part (a). Part (c) is 
done by induction, using the L'Hopital's rule as follows. It is obvious that lim xg\{x) = 2. 

x— >oo 

Next, suppose lim xgAx) = 2. Then 

x— >oo 

r r\ v 9j+i( x ) r ^ ~ 9j(x)gj+i(x) 

lim xgj + i{x) = lim _ — = lim — — = 2 lim xgj + i{x) — 2 

x— >oo x^oo x x— !>oo — X x— >oo 
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implying lim xgj+%(x) = 2. The statement in part (d) follows from the tree construction 

x— >oo 

process. A proof of part (d) using differential equations is given in the following subsection. 
Part (e) follows from part (a) together with Holder inequality 

oo 

^INIl 2 [0,oc) = J 9j{x)g j+ l{x)dx < ||^|| L 2[0,oo) • ||&+l|U 2 [0,oo), 


which implies ll f + ,;" L2[0 '° o) > §. 

□ 



6.1 Rescaling to [0, 1] interval 

Let 

h k (x) = (1 - x)~ l - (1 - x)~~ 2 g k +i 



2x 
1 — x 



for x G [0, 1]. Then h = 0, hi = 1, and the system of ODEs ^ rewrites as 

K+i( x ) = ^h k (x)h k+l {x) - h 2 k (x) (10) 

with the initial conditions h k (0) = 1. 

The above system of ODEs (10) is the quasilinearised Riccati equation h'(x) = h 2 (x) 
over [0, 1), with the initial value h(0) = 1. Its solution is h(x) = j^— . Here 



h k (x) — > h(x) 



1 — x 



and n k 



1 - h k+l /h\ 



L 2 [0,1] 



|l - h k /h 



|2 

Il 2 [o,i] 



Observe that h,2(x) = (1 + e )/2, but for k > 3 finding a closed form expression becomes 
increasingly hard. Given h k (x), Eq. (10) is a linear first-order ODE in h k+ i(x); its solution 
is given by h k+ i(x) = %h k (x) with 



Hf{x) 



x -2ff(s)ds 

f(y)e dy 



2jf(s)ds 

e 



Hence, the problem we are dealing with concerns the asymptotic behavior of an iterated 
non-linear functional. 

Using the setting of (10), we give an ODE proof to Proposition [T](d). To do so, we first 
need to prove the following lemma. 



Lemma 1. 



|l - h k+ i/h\ 



L 2 [0,1] 



\h k+ i/h - h k /h\ 



L 2 [0,1] 
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Proof. Observe that 

h 'k+i( x ) + ( h k+i{x) - h k (x)) 2 = h\ +1 (x). 
We now use integration by parts to obtain 



i 

/ 



(h k+ x(x) - h k (x))' 



h 2 (x) 



dx 



1 i 

2 



h 2 k+ i(x) f h' k+1 (x) 

ax — — h ; ax 



h 2 (x) 



h 2 {x) 



dx 



h 2 (l) h 2 {0) 



h 2 (x) 

h k +i{x) 
h(x) 



dx 



h kMdx + l-2 f^-dx 



h 2 {x) 



(l-h k+1 (x)f 
h 2 (x) 



dx 



since l/h(x) = 1 — x. 



□ 



Alternative proof of Propositioned). Notice that h> ■■■ > h k +i > h k > ■ ■ ■ > h = 0, 
which follows from g\ (t) > gi (t) > g% (t) > . . . The Lemma [I] implies 



|l - h k+1 /h\\ L2[QA] = \\h k+ i/h - h k /h\\ L2[0 ^ =/[(!- WO ~ ( x ~ ^fc+iA)] 



I I Z II I I Z I 

1 - h k+1 /h\\ L2[0 ^ + ||1 - h k /h\\ L2lni] -2 / (1 - h k /h)(l - h k+1 /h)dx 



\L*[0,1] 



and therefore 



1 - h k /h\\ L2[0 ^ = 2 / (1 - h k /h)(l - h k+1 /h)dx 



2||l - ^fe+iA|| Z 2 [0il] + 2 / {h k+ i/h - h k /h)(l - h k+l /h)dx > 2||l - /i fe+ i//i|| r: 



L 2 [0,1] 



yielding i > 



£ 2 [0,1] 



1 — ftfe //l 



rij +1 as in Proposition 



i 2 [0,l] 



;d). 



□ 



It is also true that one can improve Proposition |T|(d) to become a strict inequality since 
one can check that in fact the inequalities on h k are strict: 



h > ■ ■ ■ > h k +i > hk > • ■ ■ > ho = 0. 
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6.2 Proof of the existence of the root-Horton limit 

Here we present the proof of our main Theorem [TJ It is based on the following two lemmas, 
Lemma [2] and Lemma |3j that will be proven in the following two subsections. 

Lemma 2. If the limit lim exists, then lim (A/l)* = lim I TT n, I also exists, and 



lim {M k y = lim ( — ^tY = lim 



Lemma 3. T/ie Kmrf lim fe ^^ 1 ' > > 1 exists, and is finite. 



fc-5-00 

k 



Theorem 1. J7ie Zirmi lim (J\f k ) k = lim I TT n; | exists and is equal to lim , fcfc( - 1 ,^ • 

Proof. The existence and finiteness of lim ^j^y shown in Lemma |ij is the precondition for 
Lemma 2 that in turn implies the existence and finiteness of the limit lim as needed 

fc— s>oo 

for the root-Horton law. □ 

6.3 Proof of Lemma [2] and related results 
Proposition 2. 

||l - h k+ i(x)/h(x)\\ 2 L2[01] < J^-JJj - II 1 ~ h k( X )/ h ( x )\\ 2 L 2 [0>1 y 

Proof. Integrating from to 1 both sides of the equation 

£h k+1 (x) _ _ (h k+ i(x) - h k (x)) 2 
h l+i( x ) h l+A x ) 

we obtain 7 -^ 7TT = } i^±^hM^ dx as h ( ) = 1. 



Hence, 



1 1 

1 _ f (hk+l(x) ~ h k( X )) 2 dx > f ( h k+l{x) - h k( x )) 2 dx _ f f 1 h k+1 (x) ' 



/i fc+ i(i) y ^Li(^) y ^ 2 (^) J \ H x 



by Proposition [TJ proving the first inequality. 

Now, 



(x)/h(x^ " 



h (!) = II 1 - ^W/^+iWll^^!] < ||1 - h k (s 



L 2 [0,l] 



thus completing the proof. □ 
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Proof of LemmalR If the limit lim exists and is finite, then lim (7— Wr ) must also 

exist and be finite. Hence the existence and finiteness of 



1 

2 \ k 



lim (jV fc )* = lim ( / (l - dx 

k^oo k^oo \J Q y h{X) J 

follows from Proposition |2j □ 
6.4 Proof of Lemma [3] and related results 

In this subsection we use the approach developed by Drmota [8] to prove the existence 
and finiteness of lim h} l + )[\ > > 1. As we observed earlier this result is needed to prove the 

fc^oo MD - 



k 



existence and positivity of lim (A4) k = lim \ Y[ n j ] > the root-Horton law. 

k— >oo k— >oo y j=\ ' 

Definition 4. Given 7 G (0, 1]. Let 

V k,y( X ) ~~ 



1 for < x < 1 - 7, 



7~ 1 /tfc f ^-^-T) ) forl-7<x<l. 



Note that sequences of functions h k (x) and Vfe j7 (x) can be extended beyond x = 1. 
Here are some observations we make about the above defined functions. 

Observation 1. V kn (x) are positive continuous functions satisfying 

= 2V k+ln {x)V kn {x) - V k 2 y (x) 
for all x G [0, 1] \ (1 — 7), with initial conditions Vfc )7 (0) = 1. 

Observation 2. Let 71, = r^^r - Then 

,K M-i(i) 

^, 7fc (l) = ^ +1 (1) (11) 

and 

Vfc, 7 (l) = 7~ 1 /i fe (l) > /ife + i(l) whenever 7 < 7^. (12) 

Observation 3. 

Vfc, 7 (x) < V k+la {x) 
for all x G [0, 1] since h k (x) < h k+1 (x). 

Observation 4. Since h\(x) = 1 and 71 = ^4jy, 



1 



/i 2 (x) < Vi >7l (x) 



for < x < 1 - 71, 
7 - 1 = / l2 (l) for 1 -71 < a; < 1. 
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The above observation generalizes as follows. 
Proposition 3. 



hk+i(x) < V knk (x) 



for < x < 1 - 7fe, 



lk 1 hk( x ^p A ) for l-j k <x<l. 



In order to prove Proposition [3] we will need the following lemma. 

Lemma 4. Functions V ka (x) and h k+ i(x) have at most one positive intersection for x > 7 
and k > 1. 

Proof. This is a proof by induction with base at k — 1. We need to show that if V kn (x) and 
h k+ i(x) have at most one positive intersection, then so do Vk+i }1 (x) and hk+2(x). Since both 
sequences of functions satisfy the same ODE relation (see Observation 1), we have 



d_ 

dx 



-2 J h k+1 (y)dy 

(Vk+i„(x) - h k +2{x)) ■ e ^ 



-2 / h k+1 (y)dy 

= {2V k+ i n {x) - V kn {x) - h k+ i(x)) ■ {Vk,j(x) - h k+ i(x)) ■ e l ~i , 

where h k+ i(x) < V k+ \ n {x) by definition of Vk+x )7 (x), and V ka (x) < Vk+x t7 (x) as in Observa- 
tion 3. 

Now, let 

r -2 f h k+1 (y)dy 

I(x) := / (2Vfc+i i7 (s) - V kr/ (s) - h k+ i{s)) ■ {V kjJ {s) - h k +i{s)) ■ e ^ ds. 

1-7 

Then 

x 

—2 f 7ife.fi (y)dy 

(Vk+i^ix) - h k+2 (x)) ■ e ^ = Vfc+i i7 (l - 7) - h k+2 (l - 7) + I{x). 



The function 2V k+ i n (x) — V k ^(x) — h k+1 (x) is positive, and since V k)7 (x) and h k+1 (x) have 
at most one positive intersection, then I(x) should change the sign from positive to negative 
at most once as x grows from 1 — 7 to 1. Hence 

x 

2 / h k+1 {y)dy 

V k+lr/ (x) - h k+2 (x) = (V k+lr/ (l - 7) - h k+2 (l - 7) + I(x)) ■ e l -~> 
should change the sign from positive to negative at most once as 

Vfc+i, 7 (l - 7) = h(l - 7) > h k+2 (l - 7)- 

□ 
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Proof of Proposition^ Lemma [4] implies that h k +i(x) and Vk rjk (x) should have no more 
than one positive intersection within the interval [1 — 7^, 1]. Hence, this unique positive zero 
is located at x — 1, where hk+i(l) = Vfc i7fe (l). Therefore, Vfc )7fc (l — 7^) > hk+i(l — 7fc) implies 
hk+i(x) < Vfc i7fc (x) as in the statement of the proposition. □ 

Now we are ready to prove the monotonicity result. 

Lemma 5. 

Ik < lk+i for all fceN. 
Proof. We prove it by contradiction. Suppose 7^ > 7^+1 for some fceN. Then 



Vfc, 7fc (a;) < V k , Jk+1 (x) 



1 

l-x 



for < x < 1 - 7fc+i, 



and therefore 

/ifc+i(x) < V km {x) < V fe)7fe+1 (x) < Vfc+i i7fc+1 (x) 

as /ifc + i(a;) < Vfc i7fc (x) by Proposition [3] 
Recall that for x £ [1 — 7^+1, 1], 

where at 1 — 7^+1 we consider only the right-hand derivative. Thus for x 6 [1 — 7*1+1, 1], 



c/.r 



Vk+i,-y k+1 (x) - h k+2 (x)j = A(x) + B(x)\V k+lm+1 (x) - h k+2 (x)y 



where A(x) = 2V k+ i^ k+1 (x)-Vk, 7k+1 (x)-h k+ i{x) > 0, B(x) = 2h k +i(x) > 0, and V k+ i ak+l (1- 
7 fc+ i) - /i fe + 2 (l - 7fe+i) = h(l _ 7fc+i) - ^+2(1 - 7Jfc+i) > 0- Hence 

Vfc+i, 7fc+1 (l) - h k+2 (l) > Vfc+i i7fe+1 (l - 7 fc+ i) - ^+2(1 - 7*+i) > 

arriving to a contradiction since V k+ i t7k (1) = /ife+ 2 (l)- □ 

Corollary. Limit lim 7^ exists. 

k— >oo 

Proof. Lemma [5] implies 7^ is a monotone increasing sequence, bounded by 1. □ 
Proof of Lemma\3^ Lemma |ij follows immediately from an observation that = □ 

7 Relation to the tree representation of white noise 

This section establishes equivalence between the Kingman's coalescent and the level-set tree 
of a white noise. We consider separately the finite case (Markus-Lushnikov process) and 
infinite case (Kingman's coalescent). 
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7.1 Finite case 

We start with recalling basic facts about tree representation of time series. For details 
and further results see [49J. Consider a finite time series Xk with discrete time index k = 
0,1,..., N. Let X t (also denoted X(t)) be time series with continuous time t 6 [0, N] 
obtained from X k by linear interpolation of its values. The level set C a (X t ) is defined as 
the pre-image of the function values above a: 

C a (X t ) = {t : X t >a}. 

The level set C a for each a is a union of non-overlapping intervals; we write \C a \ for their 
number. Notice that (i) \C a \ = \Cp\ as soon as the interval [a, (3] does not contain a value 
of local minima of X t , (ii) \C a \ > \Cp\ for any a > (3, and (iii) < \C a \ < n, where n is the 
number of the local maxima of X t . 

The level set tree level(JQ) describes the topology of the level sets C a as a function of 
threshold a, as illustrated in Fig. [2j Namely, there are bijections between (i) the leaves of 
level(X 4 ) and the local maxima of X t , (ii) the internal (parental) vertices of level(X 4 ) 
and the local minima of X t (excluding possible local minima at the boundary points), and 
(iii) the edges of level(Xj) and the first positive excursions of X(t) — X(ti) to right and 
left of each local minima t{. The leftmost and rightmost edges of level (JQ) may correspond 
to meanders, that is to a positive segments of X(t) — X(ti), rather than to excursions. It 
is readily seen that any function X t with distinct values of the local minima corresponds to 
a binary tree LEVEL(X t ). Please see [1H] for a discussion of some subtleties related to this 
construction as well as for further references. 

Theorem 2. The level-set tree of a white noise of length N < oo with discrete time and 
continuous values is combinatorially equivalent to the tree generated by a Markus-Lushnikov 
coalescent process with kernel K(i,j) = 1 on N particles. 

Recall that the combinatorial equivalence refers here to considering only branching struc- 
ture of a tree and ignoring the time marks. 

Proof. The proof uses the duality between coalescence and fragmentation processes [2]. 
Specifically, a coalescence process on N particles with time-dependent collision kernel K t (x, y), 
1 < x,y < N is equivalent, upon time reversal, to a discrete-mass fragmentation process of 
initial mass iV with some splitting kernel S t (m,x), l<m<N,l<x<N. Recall that a 
fragmentation process starts with a single cluster of mass N. Each existing cluster of mass 
m splits into two clusters of masses m — x and x at the splitting rate S t (m,x). See Aldous 
[2J for further details and the relationship between the dual collision and splitting kernels in 
general case. 

The Marcus-Lushnikov coalescence with time- independent kernel K(x, y) = 1 is dual to 
the fragmentation process with splitting kernel [2] Table 3] 



St(m,x) = — r. 

v ' ; t{t + 2) 

This kernel is constant with respect to the cluster mass, which means that the splitting of 
mass m is uniform on the interval [1, ...,m — 1]. The time dependence of the kernel does 



20 



Y.V. Kovchegov and I. Zaliapin 



Horton self-similarity of Kingman's coalescent 



not affect the combinatorial structure of the fragmentation tree (and can be removed by a 
deterministic time change.) 

The level-set tree LEVEL(Xt) also can be considered as a tree for a fragmentation process 
with the initial mass iV equal to the length of the time series. Indeed, by construction, each 
subtree of level(X 4 ) with n leaves corresponds to an excursion (or meander, if we treat 
one of the boundaries) of length n. This subtree (as well as the corresponding excursion or 
meander) splits into two by the minimum of X t at the corresponding interval. For a white 
noise, the location of the minimum is uniform within the interval, so is the corresponding 
splitting kernel. Hence, the level set tree level (JQ) has the same combinatorial structure 
as the tree for a Markus-Lushnikov coalescent with constant kernel. □ 

Remark 1. We notice that the dual splitting kernels for multiplicative and additive coales- 
cences [21 Table 3] only differ by their time dependence, and are equivalent as functions of 
mass. Hence, the combinatorial structure of the respective trees is the same. 

7.2 Infinite case 

To treat infinite case, we recall the geometric representation of the Kingman's coalescent 
[21 Construction 2]. Start with placing a unit mass at each integer i 6 Z; these points 
represent initial particles. Clusters are formed by appearance of links of the form + 1) 
that connect initial masses to consecutively larger clusters. Each link appears at a specified 
time-dependent space-independent rate, independently of the other links; the particular 
form of time-dependence is not important for our analysis. We notice that this process is 
combinatorially (not considering time, only the order of link appearance) equivalent to the 
level-set cluster formation in a white noise. Specifically, we associate each link + 1) 
with the ith value of the white noise. The order of link appearances is now governed by the 
respective value of the white noise: higher values appear earlier than lower ones. Hence, the 
Kingman's coalescence process within each finite interval (in its geometric interpretation) 
can be described by a tree that is combinatorially equivalent to a tree of a finite white noise. 

An infinite tree that represents infinite white noise can be constructed as follows. 
Choose a local maximum of the white noise W(t). The interval iibetween two consecutive 
local minima adjacent to the chosen local maximum correspond to a leaf of (according 
to the level-set tree construction described above). We denote this leaf by T\. Now extend 
the interval l\ to the interval I2 D I\ between two local minima of W(t) and such that 
it includes exactly two local maxima of W(t). This extension can be done in exactly two 
ways: by adding the local maximum to the right or to the left of the initial one. The 
direction of extension does not affect the final result. The interval I2 corresponds to the 
tree T2 obtained from T\ by adding the second leaf. Continue the extension procedure by 
adding another adjacent local maximum: it will correspond to interval I3 and the tree T3 
obtained from T2 by adding one leaf. In general, each interval I n of the white noise that 
contains n consecutive local maxima corresponds to a unique branch T n of with n leaves. 
If we extend this interval to I n +i by adding values to the right (of left) to include n + 1 
local maxima, then I n C I n +\ and the tree T n+1 that correspond to I n +\ is obtained from 
T n by adding a single rightmost (leftmost) leaf. The precise position of this leaf within the 
rightmost (leftmost) branch of T n is determined by the values of local minima within I n +i- 
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One can also construct using the so-called forest of trees attached to a floor line 
[3T] . Namely, we start at any local minimum of the white noise and divide the trajectory 
into infinite sequence of finite vertically shifted positive excursions separated by intervals of 
monotone decrease, as discussed in [331 Section 5]. This procedure is illustrated in Fig. [3j 
Each excursion corresponds to a finite tree. The infinite tree is constructed by connecting 
these finite trees together as illustrated in Fig. [3} The resulting tree is the same as the one 
constructed above by adding individual leaves. 

We can summarize this discussion in the following theorem. 

Theorem 3. The Kingman's coalescent is combinatorially equivalent to the level-set tree of 
infinite white noise with discrete time and continuous values. 

Now we can extend the definitions of branch statistics relevant to the Horton and Toku- 
naga analyzes to an infinite Kingman's tree. Let us fix an arbitrary local minima of a white 
noise and count the other local maxima to the right and left of the chosen one using positive 
and negative integers k ^ 0, respectively. Let N rt k, fc e Z, ^ 0, r > 1 denotes the number 
of branches of Horton- Strahler order r in the tree that corresponds to the interval of the 
white noise between two local minima and such that it includes all local maxima between 
—k and k, and only those. 

We notice that 



which provides an intuitive interpretation for the quantities M r in terms of an infinite tree 
that corresponds to the Kingman's coalescent. Clearly, our construction is another way 
to obtain the Kingman's coalescent as the limit of the Marcus-Lushnikov process with N 
particles as iV — > oo, in terms of the corresponding trees. 

8 Numerical results 

This section illustrates the theoretical results of the paper and provides further insight into 
self-similarity of Kingman's coalescent in a series of numerical experiments. 

8.1 Horton self-similarity 

Figure [4] shows (by shaded circles) the asymptotic proportion A4 of branches of order k in 
the Kingman's coalescent according to the equation 



tations are done in Matlab, using the nominal absolute tolerance of e = 1CT 16 on an irregular 
time grid with steps decreasing towards 1 and being as small as A = 10~ 10 . The values of 
the estimated A4 as well as the consecutive ratios A4+i/Afc are reported in Table 1 with 7 
digit precision. The values of Afk are exponentially decreasing in such a way that the ratio 





(13) 



The integral is evaluated using the numeric solution hk(x) to the system (10). The compu 
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jVfe+i/Afc quickly converges to R = 0.328533 .... This ratio corresponds to the geometric 
decay 

A4 = (0.328533. . = lo^" 1 ) 10 ^ - 328533 --) = xqM- 4834 -) 

A black line in the figure illustrates geometric decay with this rate and an arbitrary offset. 

The observed convergence of ratios is stronger than the root-convergence proven in our 
Theorem [TJ We believe that in fact the strongest geometric convergence (see Definition 1) 
is also holds for the sequence A4- 

Figure [4] and Table 1 report as well the number Wj. of branches of order k observed in a 
level-set tree of a synthetic white-noise of length iV = 2 18 = 262, 144 with Horton-Strahler 
order Q — 11. The values are reported with 3 significant digits. The agreement between the 
statistics obtained in asymptotic Kingman's coalescent and white-noise time series is very 
good, in accordance with our equivalence Theorem [2j We notice that the agreement is good 
event though we only considered a single white-noise tree; this hints at a rapid convergence 
of the Horton statistics as the tree size increases. 



Table 1: Statistics of order- /c branches for Kingman's coalescence (columns 2 and 3) and 

white noise (column 4). 



k 


A4 


A4+ 


i/A4 




1 


1.0000000 


0.3333333 


1.000 


2 


0.3333333 


0.3290606 


0.330 


3 


0.1096869 


0.32i 


35928 


0.109 


4 


0.0360423 


0.32f 


35402 


0.0356 


5 


0.0118413 


0.32f 


35344 


0.0116 


6 


0.0038903 


0.32S 


35338 


0.00392 


7 


0.0012781 


0.32S 


35337 


0.00134 


8 


0.0004200 


0.32S 


35337 


0.00043 


9 


0.0001380 


0.32; 


35337 


0.00014 


10 


0.0000453 


0.32; 


35337 


0.0000458 


11 


0.0000149 






0.0000114 



8.2 Tokunaga self-similarity 

To write out the equations for the Tokunaga indexes Ty, we observe that the asymptotic 
ratio of Nij to iV for i < j is given by the original ODE ^ as 



Mij = / 7}i(t)r}j(t)dt = / (gi(x) - g i+ i(x)) (g^x) - g j+ i(x)) dx. 



This can be rewritten using the rescaled equations in (j 10|) as 

A/i-ij-i = 2 



1 (hi(x) - h i+ i(x))(hj(x) - h j+1 (x)) 



h 2 (x) 



dx. 
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We now use ( 13 ) to obtain 



lim Tj_n = lim ^\ T 1,j 1 = 1 

1 



r (hi(x)-h i+1 (x))(h J (x)-h : j +1 (x)) , 
J h 2 (x) UX 



h 2 {x) 

(14) 



0} 

n \ 



hj (x) x 



h(x) 

x 



dx 



Table 2 reports the Tokunaga indexes evaluated numerically using (14). The integrals 
and functions h k {x) are evaluated in Matlab with nominal absolute tolerance e = 1CT 20 and 
using the same grid as in computing the Horton statistics. The values are reported here with 
4 digits precision. We notice that evaluation of Tokunaga indices for larger ij pairs faces 
numerical problems because of divergence of h(x), and the associated "bursts" of h k (x), at 
unity. 

The reported index values Ti^ +k converge to the limit T k as % increases. The convergence 
rate is very fast; the limit value, within the reported precision, is achieved for % > 4 or faster. 
The reported simulations suggest that the convergence rate increases with k. 

Table 2: Asymptotic Tokunaga indices TL for Kingman's coalescence 





j = 2 3 


4 


5 


6 


7 


8 


9 


i = 1 


0.8196 0.5687 


0.2641 


0.0993 


0.0342 


0.0114 


0.0038 


0.0012 


2 


0.8234 


0.5720 


0.2655 


0.0999 


0.0344 


0.0115 


0.0038 


3 




0.8232 


0.5724 


0.2657 


0.0999 


0.0344 


0.0115 


4 






0.8231 


0.5724 


0.2657 


0.0999 


0.0344 


5 








0.8231 


0.5724 


0.2657 


0.0999 


6 










0.8231 


0.5724 


0.2657 


7 












0.8231 


0.5724 


8 














0.8231 



Figure [5] shows by shaded circles the "limit" Tokunaga indices T^g, i = 1, ... ,8, as a 
function of k — 9 — i. The figure suggests that the Tokunaga indices form a geometric series, 
asymptotic in k. Summing up, we conjecture that 

T k 

hm T M+fc =: T k and hm -7— = a 

i— >oo k— »oo C 1 

with a w 3 ± 0.02, c ~ —0.5 ± 0.02 obtained by fitting a geometric series to the last couple of 
"limit" Tokunaga indices. It must be noted that the conventional Tokunaga self-similarity 
assumes the geometric form of T k starting from k — 1. Hence, some of the results that hold for 
Tokunaga self-similar trees will not hold for the asymptotically Tokunaga trees. This applies 
for instance to the relation ([5]). Interestingly, this relation is closely satisfied, with less than 
0.01 relative error, for R = 0.328533 from the above section and (a, c) = (0.8230, 0.6953) that 
are obtained by fitting a geometric series to the first two "limit" Tokunaga indices Tg^ w T\ 
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and T7 9 m T 2 . This reflects the fact that the side-branching is rapidly fading off with k and 
the most important side-branches, that in effect form the entire tree, are those with k < 2. 

We also show in Fig. [5J by black squares, the Tokunaga indices Tk estimated for the 
level-set trees of discrete white noise. The values Tk are obtained by averaging the empirical 
Tokunaga indices in 100 trees of size N = 2 17 = 131, 072; the typical order of such trees is 
Q = 11. The indices are averaged over different trees and over all pairs of subindices {i, i + k} 
with i / 1. The empirical white- noise generated values match very closely the asymptotic 
values from Kingman's process, in accordance with our equivalence Theorem [2] 



9 General coalescent processes 

The ODE approach introduced in this paper can be extended to the coalescent kernels other 
than K(i,j) = 1. For that we need to classify the average number of clusters of order j per 
unit volume, Nj(t), at time t according to cluster masses. Namely, let T]j,k{t) ^ e ^ ne avera g e 
number of clusters of order j and mass k > 2 J per unit volume, at time t. Then 

00 

fc=2J 

The average number of clusters of mass k per unit volume Xk(t) = Y2Vj,k{t) at time t 

j 

can be determined using the original Smoluchowski equations Q. As for the Smoluchowski- 
Horton ODEs, in the case of a general coalescent process with a symmetric rate kernel 
K(i,j) = K(j,i) these ODEs can be written asymptotically as 



j-l fc-2 8 



^j.Vj,k{t) = fy\is(*)»7i,k-K- K '(«> k- k) (15) 

i=l K =2i 



+ \ Yl Vj-i,k 1 ( t )Vj-i,k2(t) K (ki,k 2 ) 



2 

kiM>2i~ 



k=2i 



,i=l 



with the initial conditions ^i,i(0) = 1 and %,fc(0) = for all (j, k) 7^ (1, 1' 



Observe that when K(i,j) = 1, summing the above equations (15) over index k produces 
the Smoluchowski- Horton ODE Q for the average number of order-j branches rjj (t) per unit 
volume in the Kingman's coalescent process. 



10 Discussion 

This paper establishes Horton self-similarity and states a numerical conjecture about the 
asymptotic Tokunaga self-similarity for a celebrated Kingman's coalescent, which is among 
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the most studied probabilistic models of branching structures. We also establish Horton 
and asymptotic Tokunaga self-similarity for a tree representation of white noise. Combining 
the results of this study with that of Burd et al. [4] and Zaliapin and Kovchegov [49J, one 
concludes that Horton and Tokunaga self-similarity are intrinsic properties of (i) white noise, 
(ii) random walk, (iii) critical binary Galton- Watson process, and (iv) Kingman's coalescent. 
The listed processes are believed to closely depict actual physical and biological mechanisms 
of diverse origin; they are hence among the essential building blocks in natural and computer 
sciences modeling. The results of this study and those in jH 09] thus provide at least a partial 
explanation for the omnipresence of Horton and Tokunaga self-similarity in observed and 
modeled branching structures. 

Our Theorem [I] establishes a weak, root-Horton law (see Definition la), while we believe 
that the two stronger versions (ratio and geometric) are also valid. It seems important to 
show rigorously at least the ratio- Horton law, because (a) the ratio-Horton law is commonly 
used in the applied literature, and (b) the ability to work with asymptotic ratios is neces- 
sary to tackle the Tokunaga self-similarity, which provides much stricter constraints on a 
branching structure. 

The Smoluchowski-Horton equations ^ that form a core of the presented method and 
their asymptotic equivalents ^ and (10) seem to be promising for further more detailed 
exploration. Indeed, one may hope that the approach that refers explicitly to the Horton- 
Strahler orders might effectively complement conventional analysis of cluster masses. The 
analysis of the Smoluchowski-Horton systems can be done within the ODE framework, sim- 
ilarly to the present study, or within the nonlinear iterative system framework. The latter 
approach is still to be explored. 

Finally, it is noteworthy that the analysis of multiplicative and additive coalescents ac- 
cording to the general Smoluchowski-Horton system (15) appears, after a certain series of 
transformations, to follow many of the steps implemented in this current paper for the King- 
man's coalescent, with the ODE system being replaced by a suitable PDE one. These results 
will be published elsewhere. 
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(a) Horton-Strahler orders (b)Tokunaga indices 

Figure 1: Example of (a) Horton-Strahler ordering, and of (b) Tokunaga indexing. Two 
order-2 branches are depicted by heavy lines in both panels. The Horton-Strahler orders 
refer, interchangeably, to the tree nodes or to their parent links. The Tokunaga indices refer 
to entire branches, and not to individual vertices. 




( a ) Function^ (b) Tree level(X) 



Figure 2: Function X t (panel a) with a finite number of local extrema and its level-set tree 
level (X) (panel b). 
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Figure 3: Illustration of tree construction for an infinite time series. The time series X t 
(shown on the left) is divided here into two vertically shifted excursions, marked A and B in 
the time axis, and one fall, depicted by the heavy segment on the time axis. The excursions 
correspond to the two trees represented by marked triangles on the right. 




Figure 4: Horton self-similarity. Filled circles: The asymptotic ratio A4 of the number N k of 
branches of order k to N in Kingman's coalescent, as N — > oo. Black squares: The empirical 
ratio N k /Ni in a level-set tree for a single trajectory of a white noise of length iV = 2 18 . 
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Figure 5: Asymptotic Tokunaga self-similarity. Filled circles: The asymptotic Tokunaga 
indices Tjo, in Kingman's coalescent, as N — > oo. Black squares: The empirical Tokunaga 
indices averaged over 100 level-set trees for white noises of length N = 2 17 . 
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